Feeds to Scour
SubscribedAll
Scoured 15857 posts in 1.17 s
PDFInspect: A Unified Feature Extraction Framework for Malicious Document Detection
arxiv.org·14h
📄PDF Forensics
Preview
Report Post
Show HN: We built an OCR API to stop babysitting extraction pipelines
news.ycombinator.com·1d·
Discuss: Hacker News
👁️Constructive OCR
Preview
Report Post
Adobe Acrobat can now generate presentations and audio podcasts from your documents
engadget.com·5h
📄PDF Internals
Preview
Report Post
From Paginā to Webpage: On Developing and Documenting a Digitized Latin Collection (Journal of Open Humanities Data)
rbfirehose.com·1d
📜Manuscript TEI
Preview
Report Post
Best Data Extraction Tools in 2026: An AI-First Guide for Data, LLM, and RAG Systems
dev.to·11h·
Discuss: DEV
🤖Archive Automation
Preview
Report Post
Redacting Faces, People, Vehicles, and Plates with Amped Replay Assisted Redaction
blog.ampedsoftware.com·4h
🧪Archive Fuzzing
Preview
Report Post
From Text to Map: A Reproducible Geocoding Pipeline for Ottoman Studies
digitalorientalist.com·1d
📜Palimpsest Analysis
Preview
Report Post
Patterns All the Way Down: A Generalization for Graph-Like Things
medium.com·3h·
Discuss: Hacker News
🤝Unification Algorithms
Preview
Report Post
Extracting Embedded Images from a PDF
wadetregaskis.com·1d·
Discuss: Hacker News
📄PDF Archaeology
Preview
Report Post
PDFTextor – Offline PDF Text Extractor (Windows EXE + Full Source Code)
gum.new·1d·
Discuss: Hacker News
📄PDF Internals
Preview
Report Post
Docs2Synth: A Synthetic Data Trained Retriever Framework for Scanned Visually Rich Documents Understanding
arxiv.org·14h
📝Document Chunking
Preview
Report Post
Exploring Text Compression
denvaar.dev·16h
📝Text Compression
Preview
Report Post
Databases are magic ... until ...
silvestreperret.com·10h·
Discuss: Hacker News
🗄️Database Internals
Preview
Report Post
Merge-pdf.app – A free, privacy-first PDF Merging tool
ryansouthgate.com·11h·
Discuss: Hacker News
📄PDF Archaeology
Preview
Report Post
FEATURE - Building Frameworks for Long-Term Digital Preservation
infotoday.com·23h
🏛️OAIS Implementation
Preview
Report Post
Building a PDF Workflow with AI-Powered OCR in 2026
dev.to·4d·
Discuss: DEV
📄Document Streaming
Preview
Report Post
SysTools PDF Media Remover Software
hackster.io·2d
📄PDF Forensics
Preview
Report Post
You Probably Don’t Need a Vector Database for Your RAG — Yet
towardsdatascience.com·1d
🗂️Vector Databases
Preview
Report Post
Interpreting MARC records
folgerpedia.folger.edu·2d
📚MARC Records
Preview
Report Post
Explainable unsupervised query tagging
emiruz.com·4d·
Discuss: Hacker News
🔍Information Retrieval
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help